# Long video processing
Videomind 2B FT QVHighlights
Bsd-3-clause
VideoMind is a multimodal intelligent agent framework that enhances video reasoning ability by simulating human-like cognitive processes.
Video-to-Text
Safetensors
V
yeliudev
20
0
Videochatonline 4B
MIT
VideoChat-Online is an online video understanding model based on Phi-3-vision-128k-instruct, focusing on the video text-to-text task.
Video-to-Text
V
MCG-NJU
61
0
Videollm Online 8b V1plus
MIT
VideoLLM-online is a multimodal large language model based on Llama-3-8B-Instruct, focusing on online video understanding and video-text generation tasks.
Video-to-Text English
V
chenjoya
1,688
23
Featured Recommended AI Models